Cycles and clustering in bipartite networks 
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We investigate the clustering ability in bipartite networks where cycles of size three are absent and therefore 
the standard definition of clustering coefficient cannot be used. Instead, we use another coefficient given by the 
fraction of cycles with size four, showing that both coefficients yield the same clustering properties. The new 
coefficient is computed for two networks of sexual contacts, one monopartite and another bipartite. In both cases 
the clustering ability is similar. Furthermore, combining both clustering coefficients we deduce an expression 
for estimating cycles of larger size, which improves previous estimations and is suitable for either monopartite 
and multipartite networks. 
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I. INTRODUCTION 

One important statistical tool to access the structure of com- 
plex networks arising in many systems QQ] is the clustering 
coefficient, introduced by Watts and Strogatz 0] to measure 
"the cliquishness of a typical neighborhood" in the network 
and given by the average fraction of neighbors which are in- 
terconnected with each other. This quantity has been used for 
instance to characterize small-world networks to under- 
stand synchronization in scale-free networks of oscillators @] 
and to characterize chemical reactions j5| and networks of so- 
cial relationships 0] . One pair of linked neighbors corre- 
sponds to a 'triangle', i.e. a cycle of three connections. 

While triangles may be abundant in monopartite networks, 
they cannot be formed in bipartite networks 0IH0], where 
two types of nodes exist and connections link only nodes of 
different type. Thus, the standard clustering coefficient is 
always zero. However, different bipartite networks have in 
general different cliquishnesses and clustering abilities 01, 
stemming for another coefficient which uncovers these topo- 
logical differences among bipartite networks. Bipartite net- 
works arise naturally in e.g. social networks 0] where the re- 
lationships (connections) depend on the gender of each person 
(node), and there are situations, such as in sexual contact net- 
works llCfl . where one is interested in comparing clustering 
properties between monopartite and bipartite compositions. 

In this paper, we study the cliquishness of either monopar- 
tite and bipartite networks, using both the standard clustering 
coefficient and an additional coefficient which gives the frac- 
tion of squares, i.e. cycles composed by four connections. As 
shown below, such a coefficient retains the fundamental prop- 
erties usually ascribed to the standard clustering coefficient in 
regular, small-world and scale-free networks. As a specific 
application, two examples of networks of sexual contacts will 
be studied and compared, one being monopartite and another 
bipartite. 

Furthermore, we will show that one can take triangles and 
squares as the basic units of larger cycles in any network, 
monopartite or multipartite. The frequency and distribution 
of larger cycles in networks have revealed its importance in 



recent research for instance to characterize local ordering in 
complex networks from which one is able to give insight 
on their hierarchical structure fllll . to determine equilibrium 
properties of specific network models fT3l . to estimate the er- 
godicity of scale-free networks flUl . to detect phase transi- 
tions in the topology of bosonic networks fl4ll and to help 
characterizing the Internet structure 11511 . Since the compu- 
tation of all cycles in arbitrarily large networks is unfeasible, 
one uses approximate numerical algorithms fl3l Q(J fl7il or 
statistical estimations II 8l 1 1 911. Here, we go a step further 
and deduce an expression to estimate the number of cycles 
of larger size, using both clustering coefficients, which not 
only improves recent estimations fl9ll done for monopartite 
networks, but at the same time can be applied to bipartite net- 
works and multipartite networks of higher order. 

We start in Section|n]by introducing the expression which 
characterizes the cliquishness of bipartite networks, compar- 
ing it with the usual clustering coefficient. In Section ITTT1 we 
apply both coefficients to real networks of sexual contacts and 
in Section lTVl we use them to estimate cycles of larger size and 
show how it is applied to bipartite networks. Conclusions are 
given in SectionlVl 



II. TWO COMPLEMENTARY CLUSTERING 
COEFFICIENTS 

The standard definition of clustering coefficient C3 is the 
fraction between the number of triangles observed in one net- 
work out from the total number of possible triangles which 
may appear. For a node i with a number fcj of neighbors the 
total number of possible triangles is just the number of pairs 
of neighbors given by ki(ki — l)/2. Thus, the clustering co- 
efficient C%(i) for node i is 



C 3 (i) 



2t; 



ki{hi 1) 



(l) 



where ti is the number of triangles observed, i.e. the number 
of connections among the fcj neighbors. 
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Similarly to 63(1), a cluster coefficient €4(1) with squares 
is the quotient between the number of squares and the total 
number of possible squares. For a given node i, the number of 
observed squares is given by the number of common neigh- 
bors among its neighbors, while the total number of possible 
squares is given by the sum over each pair of neighbors of the 
product between their degrees, after subtracting the common 
node i and an additional one if they are connected. Explicitly 
this clustering coefficient reads 

n ,-\ Em=lEn=m+l9'( m ' n ) ,~ 
04(z) = £ t , (2) 

Em=l T,n=m+1 M m : n ) + *( m > n )] 

where m and n label neighbors of node i, qi(m,n) are 
the number of common neighbors between m and n and 
o,-(m,n) = (k m - r)i(m, n))(k n - 7?,(m, n)) with 77* (m, n) = 
1 + qi(m, n) + 6 mn and mn = 1 if neighbors m and n are 
connected with each other and otherwise. 

While Cz(i) gives the probability that two neighbors of 
node i are connected with each other, C±(i) is the probability 
that two neighbors of node i share a common neighbor (differ- 
ent from i). Averaging C^(i) and 6*4(1) over the nodes yields 
two complementary clustering coefficients, (C3} and (C4), 
characterizing the contribution for the network cliquishness 
of the first and second neighbors respectively. For simplicity 
we write henceforth C3 and C4 for the averages of Cs(i) and 
04(1) respectively. 

Figure ^ shows both clustering coefficients C3 and C4 in 
several topologies. In all cases C3 and C4 are plotted as 
dashed and solid lines respectively, and are averages over sam- 
ples of 100 realizations. As an example of regular networks, 
we use networks with boundary conditions where each node 
has n neighbors symmetrically disposed. In particular, for 
n = 2 one obtains a chain of nodes. For these regular net- 
works, Fig. ^ shows the dependence of the clustering co- 
efficients on the fraction n/N of neighbors, with N = 10 3 
the total number of nodes. As one sees C4 < C3 and for 
either small or large fractions of neighbors both coefficients 
increase abruptly with n. In the middle region C3 is almost 
constant, while C4 decreases slightly. Our simulations have 
shown that in regular networks the coefficients depend only 
on n/N, i.e. for any size of the regular network, similar plots 
are obtained. 

Figure [Qi shows the coefficients for small-world networks 
with N = 10 3 nodes, constructed from a regular network 
with n = 4 neighbors symmetrically disposed. The coeffi- 
cients are computed as functions of the probability p to rewire 
short-range connections into long-range connections and they 
are normalized as usual ^ to the clustering coefficients C3 4 
of the underlying regular network. As one sees, C4 yields 
approximately the same spectrum as the standard clustering 
coefficient C3 being therefore able to define the same range 
of p for which small-world effects are observed. While here 
the small-world networks were constructed with rewiring of 
short-range connections into long-range ones, the same fea- 
tures are observed when using the construction procedure in- 
troduced in Ref. I20I1 where instead of rewiring one uses ad- 
dition of long-range connections. 
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FIG. 1: Comparisons between the standard clustering coefficient 
C3 in Eq. Q (dashed line) and the clustering coefficient C\ in 
Eq. i|2j (solid line) for different network topologies: (a) in one regu- 
lar network with n neighbors symmetrically placed (N = 10 3 ), (b) 
in small-world networks where long-range connections occur with 
probability p (N = 10 3 and n = 4) and (c) in random scale-free net- 
works where the distribution of the clustering coefficients is plotted 
as a function of the number k of neighbors (N = 10 J and m — 2). 
In all cases samples of 10 2 networks were used. The distributions 
C3(k) and Cilk) are also plotted for (d) Apollonian networks I2TI1 
with N = 9844 nodes (•) and pseudo-fractal networks 12^1 with 
N = 9843 nodes (o). 



For random scale-free networks we plot in Fig.QJ the dis- 
tribution of both coefficients as functions of the number k 
of neighbors, using networks with N = 10 5 nodes and by 
given initially m = 2 connections to each node. Here, one 
observes that C4(k) is almost constant as k increases, repro- 
ducing the same known feature as the standard C$(k) apart 
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a scaling factor: Ci(k) / Cs(k) is approximately constant for 
any k. In Fig.QJl we plot the clustering distributions for two 
different deterministic scale-free networks recently studied, 
namely Apollonian networks pUl . represented with bullets •, 
and pseudo-fractal networks I22ll . represented with circles o. 
In both cases, the same power-law behavior already known for 
Cs(k) ~ k~ a in these hierarchical networks is also observed 
for the coefficient C^{k) with the same value of the exponent 
a. 

In short, the results shown in Fig. [2 give evidence that C4 
is also a suitable coefficient to characterize the topological 
features in several complex networks commonly done with 
the standard clustering coefficient C3. Furthermore, since C4 
counts squares instead of triangles, it is particularly suited for 
bipartite networks. Next, we will use this coefficient to com- 
pare different models for networks of sexual contacts, where 
both monopartite and bipartite networks arise naturally. 



III. CYCLES AND CLUSTERING IN SEXUAL 
NETWORKS 

In this Section we apply both coefficients C3 and C4 in 
Eqs. Q and to analyze two real networks of sexual con- 
tacts. One network is obtained from an empirical data set, 
composed solely by heterosexual contacts among N = 82 
nodes, extracted at the Cadham Provincial Laboratory and is 
a 6-month block data ll23ll between November 1997 and May 
1998. The other data set is the largest cluster with N = 250 
nodes in the records of a contact tracing study ll24ll . from 1985 
to 1999, for HIV tests in Colorado Springs (USA), where most 
of the registered contacts were homosexual. Figure|3]sketches 
these two networks, where one can see that cycles of different 
sizes appear. While the network with only heterosexual con- 
tacts is clearly bipartite, the network with homosexual con- 
tacts is monopartite. 

For the two networks in Fig.|2] Table H] indicates the num- 
ber T of triangles, the number Q of squares and the coeffi- 
cients C3 and C4. As one sees, although the heterosexual net- 
work has less squares than the homosexual network due to its 
smaller size, C4 is much larger. Another feature common for 
both neighbors is the average number of connections per node 
L/N ~ 1. 

Recently, we introduced lHoll a model to simulate the statis- 
tical features of these networks of sexual contacts. The model 
is a sort of a granular system with low density composed by 
N mobile particles representing persons and collisions be- 
tween them representing their sexual contacts. The collisions 
representing sexual contacts are governed by dynamical rules 
which are carried out by means of an event-driven algorithm, 
and are based on two simple facts from sociological observa- 
tions: (i) individuals with a larger number of partners are more 
likely to get new partners and (ii) sexual interactions do not 
determine the direction toward which each agent will be mov- 
ing afterward. Therefore, we choose a collision rule where the 
absolute value of the velocity of each agent increases with the 
number k of sexual partners and the moving directions after 
collisions are randomly selected Holl . 




FIG. 2: Sketch of two real sexual contact networks having (a) only 
heterosexual contacts (N = 82 nodes and L = 84 connections) 
and (b) homosexual contacts (N = 250 nodes and L = 266 con- 
nections). While in the homosexual network triangles and squares 
appear, in the heterosexual network triangles are absent (see Table|IJ. 
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0.02787 



TABLE I: Clustering coefficients and cycles in two real networks 
of sexual contacts (top), illustrated in Fig. [2] one where all con- 
tacts are heterosexual (Fig.|2|i) and another with homosexual con- 
tacts (Fig.|2j)). In each case one indicates the values of the number 
N of nodes, the number L of connections, the number T of trian- 
gles, the number Q of squares and both clustering coefficients C3 
and C4 in Eqs. Q and J2j respectively. The values of these quanti- 
ties are also indicated for networks constructed with the agent model 
recently introduced ITcil and for random scale-free networks. 



Using the same number of nodes as in the real networks il- 
lustrated in Fig.|2]and considering two types of nodes for the 
heterosexual (bipartite) case, we obtain with the agent model 
similar results for L, T, Q, C3 and C4, as shown in Table H] 
where values represent averages over samples of 100 realiza- 
tions. Remarkably, for the bipartite case not only the number 
of connections and the number of squares are numerically the 
same, but also C4 is of the same order of magnitude. Sim- 
ilar values of the topological quantities are also obtained for 
the monopartite case, with the exception of C4. Despite this 
difference, the agent model gives values for the topological 
quantities of clustering and cycles much more closely to the 
real ones than in random scale-free networks, commonly used 
to reproduce such empirical data sets of sexual contacts [25]. 
In Table U we also show the values obtained for monopartite 
and bipartite scale-free networks whose degree distributions 
are as close as possible from the distributions of the real net- 
works. 

To compare more deeply scale-free networks and networks 
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FIG. 3: Comparing topological features between networks obtained 
from the agent model (solid lines) and random scale-free networks 
(dashed lines), both used to reproduce one real monopartite net- 
work of sexual contacts (bullets): (a) cumulative degree distribution 
Pcum{k), (b) standard clustering coefficient Cs(k) in Eq. and (c) 
clustering coefficient d{k) in Eq. 0. Here N = 250 and samples 
of 100 realizations were used. For the scale-free network m — 2 
which yields the best results for the coefficients (see text). 



obtained with the agent model we study also the distribution 
of the number k of sexual partners and the coefficients distri- 
butions as functions of k. In Fig. [2 we plot these distributions 
for the monopartite network of sexual contacts sketched in 
Fig-Eb- while in Fig. |4] we plot the distributions for the bipar- 
tite network (Fig.|2^). In both figures bullets indicate the dis- 
tributions of the empirical data, while solid lines indicate the 
distributions of the networks obtained with the agent model 
and dashed lines indicate the distributions of scale-free net- 
works, with a minimum number of connections m = 2, which 
gives the best fit of a scale-free distribution to the empirical 
data with non-zero clustering coefficient. For both models we 
impose the same size as the real network and take averages 
over a sample of 100 realizations. 

As illustrated in Fig.|5Ji, the agent model reproduces accu- 
rately the cumulative degree distribution P cum (k) of the real 
monopartite network. For m = 1 the degree distribution of 
the scale-free network yields a better fit to the empirical data 
than for m = 2 but both clustering coefficients are zero for 
any degree k, which is not realistic as illustrated in Figs.|3p 
and|3j;. 

Figures [3j> and|3j; show the distribution of C3 and C4 re- 



FIG. 4: Distributions for one real bipartite network of sexual contacts 
(bullets) compared with the one of networks obtained from the agent 
model (solid lines) and with random scale-free networks (dashed 
lines): (a) cumulative degree distribution P CU m(k), (b) clustering 
coefficient C4(k) in Eq. J2j. Here Cs(k) = always, N = 82, sam- 
ples of 100 realizations were used for both models and scale-free 
networks have m = 2. 



spectively. Although the real network (bullets) is very small 
and therefore finite size effects appear, one may observe that 
C3 is larger than C4. Clearly, both models yield clustering 
coefficients of the same order of magnitude as the ones of the 
real networks, remaining the condition Cj,(k) > C^{k) ob- 
served for the real network. For scale-free networks the clus- 
tering coefficients are slightly larger than the ones of the agent 
model. 

Figures0t and|5> show the cumulative degree distributions 
and the distribution of C4 respectively, for the bipartite net- 
work of heterosexual contacts. Here, Cz(k) = for all k (not 
shown). As one sees, the cumulative distribution for the agent 
model yields a better fit to the empirical distribution than the 
one of scale-free networks, as seen in Fig.|4^. Notice that the 
cumulative degree distribution for the scale-free network devi- 
ates from a power-law at large values of k, since we are using 
a bipartite graph which decrease the number of the most con- 
nected nodes. Furthermore, comparing Fig. [4j> with Fig. [3J; 
one clearly sees that in both cases, C4 has approximately the 
same shape. 

It is interesting to observe that, while both models repro- 
duce at least qualitatively the coefficient distributions, in all 
cases the agent model fits more accurately the degree distri- 
bution of the empirical data. Furthermore, comparing C4 be- 
tween homosexual and heterosexual contacts one may rise the 
hypothesis that the cliquishness of both types of contacts is 
similar (see Sec.lVlbelow). 
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In the next Section we will present another application of 
the clustering coefficient C4, showing how it can be used to 
account for cycles of larger size in any network, and in partic- 
ular in bipartite networks. 



IV. ESTIMATING THE NUMBER OF LARGE CYCLES 
WITH SQUARES AND TRIANGLES 

Recent studies have attracted attention to the cycle structure 
of complex networks, since the presence of cycles has impor- 
tant effects for example on information propagation through 
the network 0[| and on epidemic spreading behavior l28ll . In 
order to avoid numerical algorithms for counting the number 
of cycles with arbitrary size which implies long computation 
times, an estimate of the fraction of cycles with different sizes 
was proposed fl9il . using the degree distribution P(k) and the 
standard cluster coefficient distribution C 3 (k). However, this 
estimation yields a lower bound for the total number of cycles 
and cannot be applied to bipartite networks, as shown below. 
In this Section we show that by using both C3 and C4 one 
is able to improve that estimation, being suitable at the same 
time to either monopartite and bipartite networks. 

The estimation in Ref. fl9ll considers the set of cycles with 
a central node, i.e. cycles with one node connected to all other 
nodes composing the cycle. Figure|5Ji illustrates one of such 
cycles, where the central node and each pair of its consecutive 
neighbors forms a triangle, in a total amount of four adjacent 
triangles. In such set of cycles, to estimate the number of 
cycles with size s one looks to the central node of each cy- 
cle which has a number, say k, of neighbors. The number of 
different possible cycles to occur is no(s, k) = ( *jj , 
since one has ( _ 1 ) different groups of s nodes and in each 
one of these groups there are (s — l)!/2 different ways in or- 
dering the s nodes into a cycle. The fraction of no(s, k) of cy- 
cles which is expected to occur is po(s, k) = C 3 (k) s ~ 2 , since 
the probability of having one edge between two consecutive 
neighbors is C 3 (k) and one must have s — 2 edges between 




FIG. 5: Illustrative examples of cycles (size s — 6) where the most 
connected node (o) is connected to (a) all the other nodes composing 
the cycle, forming four adjacent triangles. In (b) the most connected 
node is connected to all other nodes except one, forming two trian- 
gles and one sub-cycle of size s — 4, while in (c) the same cycle 
s = 6 encloses two sub-cycles of size s = 4 and no triangles (see 
text). 



the s — 1 neighbors. Therefore, the number of cycles of size s 
is estimated as 

N s = Ng s 2 P(k)n (s,k)p {s,k), (3) 

k=s-l 

where P(k) is the degree distribution and g s is a factor which 
takes into account the number of repeated cycles. 

The estimation in Eq. (J3j> is a lower bound for the total num- 
ber of cycles since it considers only cycles with a central node. 
For instance, in Fig.|5j5 while cycles of size s = 4 can be esti- 
mated with Eq. 0, the cycle s = 6 cannot since it has no cen- 
tral node, and in Fig. |5J; the above equation cannot estimate 
any cycle of any size. In fact, Fig. |3J; illustrates the type of 
cycles appearing in bipartite networks, where no triangles are 
observed. For such cycles C 3 (k) = and therefore all terms 
in Eq. (|3} vanish yielding a wrong estimation of the number 
of cycles. 

To take into account cycles without central nodes (Figs.[5p 
and|5j;), one must consider the clustering coefficient C 4 {k) 
defined in Eq. (0. One first considers the set of cycles 
of size s with one node (o) connected to all the others ex- 
cept one, as illustrated in Fig. |5p. In this case, since there 
are s — 2 nodes connected to node o one has n\{s, k) = 
( s ^ 2 )(s — 2)!/2 different possible cycles of size s, with k 
the number of neighbors of node o. The fraction of the 
ni(s, k) cycles which is expected to be observed is given by 
p x (s, k) = C 3 (k) s - 4 C 4 (k)(l - C 3 (fc)), since the probability 
of having s — 4 connections among the s — 2 connected nodes 
is C3(fc) s_4 , the probability that a pair of neighbors of node 
o has to share a common neighbor (different from node o) is 
Ci(k) and the probability that these same pair of neighbors 
have to be not connected is (1 — Cs(k)). Writing an equation 
similar to Eq. Q, where instead of nn(s, k) and po(s, k) one 
has m(s,k) and pi(s,k) respectively and the sum starts at 
s — 2 instead of s — 1, one has an additional number N' s of es- 
timated cycles which are not considered in estimation 0. No- 
tice that, since for N' s one considers at least one sub-cycle of 
size s = 4, this additional estimation contributes only for the 
estimation of cycles with size s > 4. We call henceforth sub- 
cycle, a cycle which is enclosed in a larger cycle and which 
do not enclose itself any shorter cycle. 

Still, the new estimation N s + N' s is not suitable for bi- 
partite networks, since it yields nonzero estimation only for 
s = 4. To improve the estimation further one must con- 
sider not only cycles composed by one single sub-cycle of 
size s = 4, as done in the previous paragraph, but also cy- 
cles with any number of sub-cycles of size s = 4. Figure 
|3J; illustrates a cycle of size s = 6 composed by two sub- 
cycles of size 4. In general, following the same approach 
as previously, for cycles composed by q sub-cycles of size 
4 one finds n q (s,k) = ^- s ~ q ~ 1 ^ ( s _^_ 1 ) possible cycles of 
size s looking from a node with k neighbors and a fraction 
p q (s,k) = C 3 (ky- 2 i- 2 C 4 {k) q (l - C 3 (k)) q of them which 
are expected to be observed. For q = one considers cycles 
as the one illustrated in Fig.|5ji, while for q = 1 and q = 2 one 
considers the set of cycles with one and two sub-cycles with 
size 4, as illustrated in Figs. |5j) and[5j; respectively. Summing 



100 



200 



50 



100 



FIG. 6: Estimating the number of cycles using Eq. dashed 
lines, and Eq. 0, solid lines. Here we impose a degree distribu- 
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FIG. 7: Estimating the number of cycles for the agent model using 
Eq. @ for N = 1000 (solid lines), N = 5000 (dashed lines) and 
N = 10000 (dotted lines) in (a) a monopartite network and in (b) a 
bipartite network, both obtained with the agent model. 



up over k and q yields our final expression 

[s/2]-l k max 

N s = Ng s J2 P(k)n q (s,k)p q (s,k). (4) 

q=0 k=s—q—l 

where [x] denotes the integer part of x. In particular, the first 
term (q = 0) is the sum in Eq. Q. The upper limit [s /2] — 1 of 
the first sum results from the fact that the exponent of C3 (k) in 
p q (s, k) must be non-negative: s — 2q — 2 > 0. The estimation 
in Eq. (0) not only improves the estimated number computed 
from Eq. but also enables the estimation of cycles up to a 
larger maximal size. In fact, since in the binomial coefficient 
( of Eq. (0 one must have s — 1 < k < k max , one 
only estimates cycles of size up to k max + 1, while in Eq. (|4} 
the maximal size is 2k max , as can be concluded using both 
conditions s — 2q — 2 > and s — q — 1 < k max . 

Figure|5]compares two cases treated in Ref. Ilfal . both with 
a degree distribution P(k) = -Prjfc -7 and coefficient distribu- 
tions C?,{k) = C^k~ a , using one value of a < 1 (Fig.|6^) 
and another one a > 1 (Fig. HJ>). Dashed lines indicate the 
estimation done with Eq. 0, while solid lines indicate the es- 
timation done with Eq. (|4}. In both cases, the latter estimation 
is larger. For a < 1 the difference between both estimations 
decreases with the size s of the cycle. For a > 1 the differ- 
ence between the estimations increases with s beyond a size 
s* < k max . Clearly, from Fig.|6p one sees that k max + 1 is 
the larger cycle size for which Eq. (|3} can give an estimation, 
while for Eq. (|4} the estimation proceeds up to 2k max (par- 
tially shown). In both cases, the typical size for which N s at- 
tains a maximum is numerically the same for both estimations, 
as expected. Moreover, for a > 1 (Fig. |6j?), beyond a size of 
the order of k max , N s /(Ng s ) in Eq. (|4} decreases exponen- 
tially with s, and not as a cutoff as observed for Eq. Q. In 
fact, the deviation of Eq. l|3} from the exponential tail, is due 
to the fact that for very large cycle sizes (s ~ k max ) Eq. 
can only consider very few terms in its sum. 

Another advantage of the estimation in Eq. @ is that it esti- 
mates cycles in bipartite networks. For bipartite network there 



are no connections between the neighbors, i.e. all subgraphs 
are similar to the one illustrated in Fig.|3J;. Therefore all terms 
in Eq. @ vanish except those for which the exponent of C3 (k) 
is zero, i.e. for s = 2(q + 1). Consequently, since q is an inte- 
ger, Eq. @ shows clearly that in bipartite networks there are 
only cycles of even size, as already known 1 8 ]. Moreover, sub- 
stituting q = (s — 2)/2 in Eq. yields a simple expression 
for the number of cycles in bipartite networks, namely 

k 

Nf m = N 9s P^^l^Cm* 12 - 1 - (5) 

k=s/2 ' 

Figure shows distribution of the fraction N s /(Ng s ) of 
cycles as a function of their size s, for a monopartite net- 
work (Fig. and a bipartite network (Fig. 05) composed 
by N = 1000, 5000 and 10000 nodes. This networks were 
generated from the agent model described in the previous sec- 
tion. Here, while monopartite networks show an exponential 
tail preceded by a region where the number of cycles is large, 
bipartite networks are composed by cycles whose number de- 
pend exponentially of their size. Furthermore one observes a 
clear transition for a characteristic size, which seems to scale 
with the network size. 

It is important to notice that triangles and squares may ap- 
pear in any multipartite network (except in bipartite ones, 
where triangles are absent). Therefore, the estimation de- 
scribed and studied in this Section can be applied not only 
to bipartite networks but to any multipartite network of any 
order. 



V. DISCUSSION AND CONCLUSIONS 

We introduced a clustering coefficient similar to the stan- 
dard one, which instead of measuring the fraction of triangles 
in a network measures the fraction of squares, and showed that 
with this clustering coefficient it is also possible to character- 
ize topological features in complex networks, usually done 
with the standard coefficient. We showed explicitly that the 
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range of values of the probability to acquire long-range con- 
nections in small-world networks and the typical clustering 
coefficient distributions of either random scale-free and hier- 
archical networks are approximately the same. In addition, 
we showed that this second clustering coefficient enables one 
to quantify the cliquishness in bipartite networks where trian- 
gles are absent. Thus, one should take triangles and squares 
simultaneously as the two basic cycle units in any network. 

An application of both clustering coefficients was proposed, 
namely to estimate the number of cycles in any network, either 
monopartite or multipartite. Using a recent estimation which 
yields a lower bound of the number of cycles in monopartite 
network up to a size s < k max + 1 where k max is the maxi- 
mum number of neighbors in the network, we deduce a more 
general expression which not only improves the previous esti- 
mation but is also suitable for bipartite networks and enables 
one to estimate cycles of size up to 2k max . Furthermore, in 
the particular case of bipartite networks our estimation yields 
as a natural consequence that only cycles of even size may 
appear. 

We also studied a concrete example of two sexual networks, 
one where only heterosexual contacts occur (bipartite net- 
work) and another with homosexual contacts (monopartite). 



The results obtained with the two real networks were com- 
pared with the ones obtained with a scale-free network and 
with an agent model recently introduced. Our results em- 
phasize that, in general, the agent model seems to be more 
suitable to reproduce networks of sexual contacts than the 
standard approach with random scale-free networks. Further- 
more, our results for the clustering distribution of both real 
sexual networks gave some evidence that the clustering abil- 
ity in sexual networks probably does not depend on the type 
of sexual contact (homosexual or heterosexual). To strengthen 
this hypothesis it is necessary to use larger networks of sexual 
contacts and apply the topological quantities here described. 
These and other questions are being analyzed in more detail 
and will be presented elsewhere. 
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